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(54) Title: A METHOD FOR ESTIMATING MOTION IN A VIDEO SEQUENCE 
(57) Abstract 

Efficient and adaptable motion estimation between frames in a video sequence is achieved in a 
video compression system (100) by first performing a series of comparisons in a first region between 
a present video block of a present video frame and a comparison video block of a previously stored 
video frame based on a first search point displacement pattern (500). Then, similar comparisons are 
performed in a second and third region based on a second and third search point displacement pattern 
respectively (501, 502). A best match video block is determined as a result of the series of comparisons 
(503). The series of comparisons may be made for a predetermined number of video blocks (600) 
and video statistics may be collected for the comparisons (601). If the video statistics are unfavorable 
(602) or if a predetermined time has expired (603), then the region used for searching for a best match 
block may be changed. 
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A Method for Estimating Motion in a Video Sequence 
Field of the Invention 

10 

The invention generally relates to the field of video 
compression, and in particular, to estimating motion between frames 
in a video sequence. 

1 5 Background of the Invention 

Video systems are known to include a plurality of 
communication devices and communication channels, which provide 
the communication medium for the communication devices. For 

2 0 example, the communication channel may be wireline connections or 

RF frequency carriers. To increase the efficiency of the video 
system, video that needs to be communicated is digitally compressed. 
The digital compression reduces the number of bits needed to 
represent the video while maintaining perceptual quality of the 

2 5 video. The reduction in bits allows more efficient use of channel 

bandwidth and reduces storage requirements. To achieve digital 
video compression, each communication device may include an 
encoder and a decoder. The encoder allows a communication device 
to compress video before transmission over a communication 

3 0 channel. The decoder enables the communication device to receive 

compressed video from a communication channel and render it 
visible. Communication devices that may use digital video 
compression include high definition television transmitters and 
receivers, cable television transmitters and receivers, video 
3 5 telephones, computers and portable radios. 
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Several standards for digital video compression have emerged, 
including International Telecommunications Union (ITU) -T 
Recommendation H.261, the International Standards Organization/ 
5 International Electrotechnical Committee (ISO/IEC) 1 1172-2 
International Standard (MPEG-1), and the forthcoming ISO/IEC 
13818-2 standard (MPEG-2). These standards seek to efficiently 
represent a sequence of frames of video by exploiting the spatial and 
temporal redundancies in the video and by taking advantage of the 
1 0 perceptual qualities of the human eye. Temporal redundancy is 
exploited by estimating the motion in a sequence of frames. 

Block matching algorithms are a known method for estimating 
motion within a video sequence. Each frame of the video sequence is 

1 5 divided into blocks of pixels. Block matching algorithms compare a 

current block that is to be encoded in one frame with blocks of the 
same size in a previous or future frame of the video sequence. If a 
matching block is found in a frame that has previously been encoded, 
then rather than encode the current block independently, the location 

2 0 of the matching block and any differences between the current block 

and the matching block may be encoded. Generally, encoding only 
the location and any difference information provides a more efficient 
method of encoding. 

2 5 A full search block matching algorithm compares the current 

block that is to be encoded with all possible matches within a search 
region of a previously encoded frame. The search region is 
generally defined in terms of motion vectors (MV) that indicate a 
displacement in location from the upper leftmost pixel of the current 

3 0 block. For example, for a current block that has the upper leftmost 

pixel at an absolute location in Cartesian coordinates (x, y) of (10, 
20), a search region of MV(+/-5, +/-5) has comer points at MV(-5, 
-5), MV(-5, +5), MV(+5, +5) and MV(+5, -5) and is defined by 
absolute locations (5, 15), (5, 25), (15, 25) and (15, 15) in the 
3 5 previously encoded frame. The full search algorithm always finds 
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the best matching video block since it examines all choices in the 
search region. Therefore, the full search algorithm provides optimal 
video quality. The problem with the full search algorithm is that it 
is computation intensive. For example, for a search region of 
5 MV(+/- 48, +/- 31), 6111 comparisons are necessary. 

A hierarchical full search algorithm uses two search regions 
with different search point displacement patterns. The first region 
has a one pixel search point displacement pattern, i.e. comparison are 
1 0 made at intervals of one pixel. This search point displacement is the 
same as for the full search algorithm. The second region has a two 
pixel search point displacement pattern, i.e., comparison are made at 
every other pixel in both the horizontal and vertical directions. The 
first region is defined by MV(+/-2, +/-2) The second region is 

1 5 defined by MV(+/-6, +/-6) minus MV(+/-2, +/-2). The hierarchical 

full search algorithm reduces the number of comparisons over the 
full search algorithm and maintains adequate video quality, but is 
still quite computation intensive, especially for large search regions. 
Extending the hierarchical algorithm to cover a search region of 
20 MV(+/-48, +/-31) still requires approximately 1695 comparisons. 

Another known block matching algorithm uses a very dense 
displacement pattern around MV(0, 0) and radially decreases the 
number of comparison points to cover a search region of 

2 5 approximately MV(+/- 10, +/- 10). The problem with this algorithm 

is that it is not readily extendible to larger search regions in the 
range of MV(+/-48, +/- 31), Radially decreasing the number of 
search points causes the distance between comparison points and 
therefore, the location of comparison points to be highly irregular. 

3 0 This makes an implementation in hardware complex if not 

impossible. In addition, as the number of comparison points is 
reduced, the probability of finding a suitable match is substantially 
reduced. This causes inferior video quality. 
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The known block matching algorithms use a consistent search 
region and search displacement pattern without regard to the video 
sequence being encoded. This is problematic in that the search 
technique that is good for one sequence is not good for another 
5 sequence. For example, a video sequence, such as a car race that has 
fast motion, may require a large search region, but a sequence such 
as a talking head in a news cast, does not require a large search 
region. 

1 0 Therefore, a need exists to restrict the search region and 

search technique to one that is reasonable in the computations 
required, but provides adequate video quality. Furthermore a need 
exists to be able to adapt the search technique to the video scene 
being encoded. 

15 

Brief Description of the Drawings 

FIG. 1 illustrates a video compression system utilizing motion 
estimation in accordance with the present invention. 

20 

FIG. 2 illustrates an exemplary video source that serves as 
input for a video compression system implemented in accordance 
with the present invention. 

2 5 FIG. 3 illustrates two exemplary video frames indicating 

regions for motion estimation in accordance with the present 
invention. 

FIG. 4 illustrates example search point displacement patterns 

3 0 and regions in accordance with the present invention, 

FIG. 5 illustrates a flow diagram of a method for motion 
estimation in accordance with the present invention. 
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FIG, 6 illustrates a flow diagram for a method for adapting 
the search region used in motion estimation in accordance with the 
present invention. 

5 

Description of a Preferred Embodiment 

The present invention provides an efficient method for video 
compression by estimating motion on a video frame by video frame 

1 0 basis. This is accomplished by comparing each video block of a 

present video frame with comparison video blocks of a previously 
stored video frame. The comparison video blocks are distributed 
throughout a region surrounding the video block of the present video 
frame. In other words, each video block of the present video frame 
15 is essentially overlaid on the comparison video blocks of the 
previously stored video frame to determine where objects in the 
previously stored video frame have moved to in the present video 
frame. This overlaying is done in a series of three different 
comparison densities. The first series of comparisons, which are 

2 0 performed using a high density level (first search point displacement 

pattern), is done in a region surrounding the video block of the 
present video frame. The highest density level is used in this region 
because most objects do not move that much from video frame to 
video frame, thus it is most probable that a match will be found in 

2 5 this area. The next two series of comparisons are done at 

respectively lower density levels as the comparison video blocks are 
positioned further and further away from the video block of the 
present video frame. Once a reasonable determination of motion has 
been made, the comparison video block of the previously stored 

3 0 video frame that most closely resembles a video block of the present 

frame may be used to encode the present video block, thus reducing 
the total number of computations needed to estimate motion while 
maintaining good video quality. 
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The present invention can be more fully described with 
reference to FIGs. 1-6. FIG. 1 illustrates a video compression 
system 100 that estimates motion between video frames in 
accordance with the present invention. A video source 111 is 
5 received by the video compression system 100. For each present 
video block in the video source 111, the motion estimator 110 
searches a previous frame storage element 108, which holds one or 
more reference pictures, to find a best match video block. The 
motion estimator 110 then produces a motion vector (MV), which 
1 0 describes the location of the best match video block in relation to the 
present video block. The motion compensator 109 uses the MV to 
extract the best match video block from the previous frame storage 
element 108. The best match video block is then subtracted from the 
present video block by subtracter 101, producing a residual video 

1 5 block. The intra/non-intra determiner 113 determines whether the 

residual video block or the present video block should be encoded. 
This decision is made based on an evaluation of the present video 
block and the best match video block. If the evaluation is favorable, 
then non-intra coding is chosen, i.e., the residual video block is 

2 0 encoded. If the evaluation is unfavorable, intra coding is chosen, 

i.e., the present video block is encoded. In addition to several known 
methods for determining whether the residual video block or the 
present video block should be encoded, a preferred method is set 
forth in co-pending U. S. patent application docket number 

2 5 CR0031M, assigned to the same assignee as the present patent 

application. 

If intra coding is chosen then the present video block is sent to 
the discrete cosine transformer (DCT) 102 to be encoded. 

3 0 Otherwise, the residual video block is sent to the DCT 102 to be 

encoded. The DCT 102 then transforms the spatial domain values of 
the video block it receives into frequency domain values preferably 
using the discrete cosine transform. Alternatives to the discrete 
cosine transform include subband coding, lapped orthogonal 
3 5 transforms, and vector quantization. 
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The frequency domain values from the DCT 102 are then 
scaled and truncated by the quantizer 103, introducing quantization 
error into the compression system 100. These quantized values are 
5 then efficiently and losslessly encoded by the variable length coder 
(VLC) 104. The MV's are also efficiently and losslessly encoded by 
the VLC 104. The output of the VLC 104 is the compressed video 
112 that may be stored or transmitted to a decompression system. A 
feedback loop consisting of an inverse quantizer 105, an inverse 
1 0 DCT 106, an adder 107, the previous frame storage element 108, 
and the motion compensator 109, reconstructs the same picture that 
the decompression system will reconstruct and stores the picture into 
the previous frame storage element 108. The inverse quantizer 105 
followed by the inverse DCT 106 reconstructs the spatial domain 

1 5 values, which include the error introduced by the quantizer 103. If 

non-intra coding was chosen by the intra/non-intra determiner 113, 
then the best match video block from the motion compensator 109 is 
added to the output of the inverse DCT 106 by the adder 107; 
otherwise zero is added to the output of the inverse DCT 106 by the 

2 0 adder 107. The output of the adder 107 is then stored into the 

previous frame storage element 108. The video compression system 
100 may be implemented with software executing on one or more 
digital signal processors or general purpose microprocessors. 

2 5 FIG. 2 illustrates an exemplary video source 111. The video 

source 1 1 1 consists of a series of video frames 200. Each video 
frame 200 is composed of a two-dimensional array of pixels 202. 
Common sizes for the video frame 200 include 352 pixels 
horizontally by 240 pixels vertically, referred to as Standard Image 

3 0 Format (SIF), 352 by 288 pixels, referred to as Common 

Interchange Format (CIF), and 176 by 144 pixels, referred to as 
Quarter CIF (QCIF). The video frames are commonly separated in 
time by 1/24, 1/25, or 1/29.97 seconds. Each video frame 200 can 
be divided into regions, where each region is referred to as a video 
3 5 block 201 . In the H.261 , MPEG- 1 , and MPEG-2 standards the video 
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block 201 is called a macroblock and consists of a 16 by 16 array of 
pixels. 

FIG. 3 illustrates a present video frame 300 and a previous 
5 video frame 301. The motion estimator 1 10 determines the motion 
of the present video block 302 from the previous video frame 301 to 
the present video frame 300 by searching three regions 303, 304, 
305 of the previous video frame 301 around the location of the 
present video block 302. The shape of the three regions 303, 304, 
10 305 are based on the MV probability distribution for typical video 
sources 111. A high percentage of MV's are near zero, i.e., very 
little motion, so the first region 303 provides a dense search point 
displacement pattern to make sure that an excellent estimate is made 
for most of the MV's. The second region 304 provides a moderately 

1 5 dense search point displacement pattern to attempt to provide a good 

estimate for the MV's which fall outside of the first region 303, i.e., 
moderate motion. Since camera pans can introduce fast motion and 
are predominantly in the horizontal direction, the third region 305 
extends the motion estimation search horizontally. 

20 

A preferred boundary for the first region 303 is the MV 
region with the comer points MV(x, y) = (-7, -7), (-7, +7), (+7, +7), 
and (+7, -7). A preferred boundary for the second region 304 is the 
MV region with the comer points MV(x, y) = (-31, -31), (-31, +31), 

2 5 (+31, +31), and (+31, -31) and the comer points MV(x, y) = (-7, 

-7), (-7, +7), (+7, +7), and (+7, -7). A preferred boundary for the 
third region 305 is the MV region with the comer points MV(x, y) = 
(-48, -31), (-48, +31), (-31, +31), and (-31, -31) and the comer 
points MV(x, y) = (+31, -31), (+31, +31), (+48, +31), and (+48, 

3 0 -31). 

FIG. 4 illustrates preferred search point displacement patterns 
for the search regions 303, 304, 305. Within each search region a 
different search point displacement pattern is used. A search point is 
3 5. a specific MV for which the corresponding comparison video block 
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of the previous video frame 301 is compared to the present video 
block 302 of the present video frame 300. The search point 
displacement pattern is a two-dimensional spatial pattern of search 
points. A first search point displacement pattern 400 is used within 
5 the first region 303. This displacement pattern should be very dense 
to ensure that an excellent estimation is made in the case of the 
highly probable small motion. The preferred pattern is a 
rectangular grid of search points spaced one pixel apart. 

1 0 A second search point displacement pattern 401 is used within 

the second region 304. This displacement pattern should be 
moderately dense to ensure that a very good estimation is made for 
most occurrences of the less probable moderate motion. By using 
this pattern, some savings in computation is made while not affecting 
15 the perceived quality of the compressed video. The preferred 

pattern is a rectangular grid of search points spaced 2 pixels apart. 

A third search point displacement pattern 402 is used within 
the third region 305. Exploiting the facts that high motion tends to 

2 0 blur the image on the camera and that the human observer is less 

sensitive to resolution in fast moving objects, this displacement 
pattern can be less dense. By using a low density pattern, significant 
savings in computation is made, while providing good quality 
compressed video for video source 1 1 1 with fast horizontal motion. 
25 A preferred pattern is a rectangular grid of search points spaced 3 
pixels apart. Alternate preferred patterns include a rectangular grid 
of search points spaced by 3 pixels horizontally and 2 pixels 
vertically, a rectangular grid of search points spaced by 4 pixels 
horizontally and 2 pixels vertically, a rectangular grid of search 

3 0 points spaced by 2 pixels horizontally and 4 pixels vertically, and a 

diamond-shaped grid of search points spaced by (8) 1 / 2 on a 45 
degree angle. 

A fourth search point displacement pattern 403 is used within 
3 5 a fourth region 404. The fourth region 404 is selected surrounding 
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a first or temporary best match video block as determined from 
searches of the first three regions 303, 304, 305. The fourth search 
point displacement pattern 403 should be very dense to hone in on 
the best match video block. The preferred pattern is a rectangular 
5 grid of search points spaced 0.5 pixels apart. 

FIG. 5 illustrates a flow diagram of a preferred 
implementation of the method of estimating motion. A first series of 
comparisons is performed in a first region 500. MV's within the 
1 0 first region 303 are selected using the first search point displacement 
pattern 400. For each selected MV, the corresponding comparison 
video block of the previous video frame 301 is compared to the 
present video block 302 using a predetermined comparison 
measurement criteria. Typically the predetermined comparison 

1 5 measurement criteria is either the mean square error (MSE) or mean 

absolute difference (MAD) between the comparison video block and 
the present video block 302. The preferred comparison 
measurement criteria is the MAD. If the comparison measurement is 
better than the previous best comparison measurement, then the new 

2 0 MV and its comparison measurement are stored. 

Then a second series of comparisons is performed in a second 
region 501. MV's within the second region 304 are selected using 
the second search point displacement pattern 401. For each selected 

2 5 MV, the corresponding comparison video block of the previous 

video frame 301 is compared to the present video block 302 using 
the predetermined comparison measurement criteria. If the 
comparison measurement is better than the previous best comparison 
measurement, then the new MV and its comparison measurement are 

3 0 stored. 

Similarly, a third series of comparisons is performed in a 
third region 502. MV's within the third region 304 are selected 
using the third search point displacement pattern 402. For each 
3 5 selected MV, the corresponding comparison video block of the 
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previous video frame 301 is compared to the present video block 302 
using a predetermined comparison measurement criteria. If the 
comparison measurement is better than the previous best comparison 
measurement, then the new MV and its comparison measurement are 
5 stored. After the first three comparisons, the MV with the best 

comparison measurement is used to determine the first or temporary 
best match video block from the previous video frame 503. 

A fourth series of comparisons is performed for MV's within 
10 a region around the first best match video block 504. This step is 
optional, but preferred. A preferred boundary for this region is the 
MV region with the comer points MV(x, y) = (mvx-1, mvy-1), 
(mvx-1, mvy+1), (mvx+1, mvy+1), and (mvx+1, mvy-1), where 
mvx and mvy are the x and y components of the best MV determined 

1 5 in step 503. A very dense search point displacement pattern is used 

within this region. A preferred search point displacement pattern is 
0.5 pixels horizontally and 0.5 pixels vertically. For each MV in this 
search point displacement pattern, the corresponding comparison 
video block of the previous video frame 301 is compared to the 

2 0 present video block 302 using the predetermined comparison 

measurement criteria. If the comparison measurement is better than 
the previous best comparison measurement, then the new MV and its 
comparison measurement are stored. Finally, the MV with the best 
comparison measurement is used to determine the best match video 
25 block 505. 

FIG. 6 illustrates a flow diagram of a preferred 
implementation of a method for adapting the search region. First the 
best match video blocks are determined for a predetermined number 

3 0 of video blocks from the video source 600. The method illustrated 

in FIG. 5 may be used to determine the best match video block for 
each current video block. The predetermined number of video 
blocks could be anywhere from one video block to an entire video 
frame of video blocks, with a preferred number equal to one row of 
3 5 video blocks. For example, the predetermined number of video 
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blocks for a SEF-sized video source 111, could be from 1 to 330, 
with a preferred number of 22. 

After determining the best match video blocks for the 
5 predetermined number of video blocks 600, video block statistics are 
collected for the predetermined number of video blocks 601. The 
preferred set of statistics include intra-count, edge-count, and outer- 
zone-count. The intra-count statistic is the number of video blocks, 
within the predetermined number of video blocks, that the intra/non- 
1 0 intra determiner 113 chose to encode using intra coding. The edge- 
count statistic is the number of video blocks, within the 
predetermined number of video blocks, whose best match video 
block corresponded to a MV "near the edge" of the search region. 
"Near the edge" means within a predetermined number of pixels 

1 5 from the edge of the search region. In a preferred implementation, 

a MV within one pixel of the edge is considered "near the edge". 
The outer-zone-count statistic is the number of video blocks, within 
the predetermined number of video blocks, whose best match video 
block corresponded to a MV in the "outer zone" away from the 

2 0 center of the search region. The "outer zone" is the sub region 

within the current search region which is outside of the next smaller 
defined search region. 

The video block statistics are evaluated to determine if they 

2 5 are favorable 602. In a preferred implementation, each statistic is 

compared to a predetermined threshold value. The intra-count value 
is unfavorable if it exceeds some threshold. A high intra-count value 
indicates that the best match video blocks are not being used and 
therefore, the best match video blocks are of poor quality. The 

3 0 edge-count value is unfavorable if it exceeds some threshold. The 

outer-zone-count value is unfavorable if it is less than some 
threshold. Preferred values for a SIF video source for the intra- 
count threshold, the edge-count threshold, and the outer-zone-count 
threshold are respectively 3, 2, and 6. If all the video statistics are 
3 5 found to be favorable, then whether a predetermined time has 
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expired is checked 603; otherwise if the video statistics are found to 
be unfavorable, then the search region is changed 604. 

The time since the search region was last changed is compared 
5 to a predetermined time 603. If the predetermined time has expired, 
then the search region is changed 604; otherwise if the 
predetermined time has not expired, then the search region is not 
changed and step 600 is performed. The preferred predetermined 
time for a SEP video source is 88 video blocks. This is 4 times the 
1 0 preferred predetermined number of video blocks used in step 600. 

The search region and search point displacement patterns are 
changed based on whether the video block statistics are unfavorable 
or whether the predetermined time has expired 604. In a preferred 

1 5 implementation, if the intra-count is unfavorable, the edge-count is 

unfavorable, or the predetermined time has expired, then the search 
region area is increased. If the outer-zone-count is unfavorable, then 
the search region area is decreased. If the area of the search region 
is increased, then the density of the search point displacement 

2 0 patterns may be decreased, so that the total number of search points 

is approximately the same. Conversely, if the search region is 
decreased, the search point displacement patterns may be increased. 
Keeping the computations per second nearly constant allows an 
efficient, reconfigurable hardware implementation to handle the 

2 5 different search patterns. 

In a preferred implementation there are 3 predetermined 
search patterns, Pattern A, Pattern B, and Pattern C. Either of the 
patterns may be chosen in step 604. Pattern A has a first region 

3 0 determined by the MV region with the comer points MV(x, y) = (+/- 

7, +/-7), and a second region determined by the MV region with the 
comer points MV(x, y) = (+/-31, +/-31) minus the first region. The 
first search point displacement pattern, which is used in the first 
region, is a rectangular grid of search points spaced 1 pixel apart. 
3 5 The second search point displacement pattern, which is used in the 
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second region, is a rectangular grid of search points spaced 2 pixel 
apart. Pattern B has a first region determined by the MV region 
with the corner points MV(x, y) = (+/-7, +/-7), a second region 
determined by the MV region with the corner points MV(x, y) = (+/- 
5 20, +/-20) minus the first region, and a third region determined by 
the MV region with the corner points MV(x, y) = (+A48, +/-31) 
minus both the first and second regions. The first search point 
displacement pattern, which is used in the first region, is a 
rectangular grid of search points spaced 1 pixel apart. The second 
1 0 search point displacement pattern, which is used in the second 
region, is a rectangular grid of search points spaced 2 pixel apart. 
The third search point displacement pattern, which is used in the 
third region, is a rectangular grid of search points spaced 3 pixel 
apart. Pattern C has a first region determined by the MV region 

1 5 with the comer points MV(x, y) = (+/-7, +/-7) and a second region 

determined by the MV region with the comer points MV(x, y) = (+/- 
48, +/-31) minus the first region. The first search point 
displacement pattern, which is used in the first region, is a 
rectangular grid of search points spaced 1 pixel apart. The second 

2 0 search point displacement pattern, which is used in the second 

region, is a rectangular grid of search points spaced 3 pixel apart. 

With the present invention, adaptable and efficient motion 
estimation is accomplished. The present invention is much less 

2 5 complex than full search and hierarchical search methods, yet 

provides good video quality. For example, for a search region of 
MV(+/-48, +/-31), the present invention requires only 1389 
comparisons, which is a 77% savings in computations when 
compared with a full search algorithm, and a 18% savings in 

3 0 computations when compared with a hierarchical search algorithm. 

In addition, the present invention allows the search region to adjust 
to the video sequence being encoded while keeping the number of 
computations fairly constant. 
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Claims 

1. A method for estimating motion within a video sequence, 
5 the method comprising the steps of: 

1a) performing a first series of comparisons between a 
present video frame and a comparison video block of a 
previously stored video frame, wherein, for each comparison of 
10 the first series of comparisons, the comparison video block is 
incrementally positioned within a first region of the 
previously stored video frame based on a first region of the 
previously stored video frame based on a first search point 
displacement pattern; 

15 

1b) performing a second series of comparisons between the 
present video block and the comparison video block, wherein, 
for each comparison of the second series of comparisons, the 
comparison video block is incrementally positioned within a 
20 second region of the previously stored video frame based on a 
second search point displacement pattern, and wherein the 
second search point displacement pattern is greater than the 
first search point displacement pattern; 

25 1c) performing a third series of comparisons between the 
present video block and the comparison video block, wherein, 
for each comparison of the third series of comparisons, the 
comparison video block is incrementally positioned within a 
third region of the previously stored video frame based on a 

30 third region of the previously stored video frame based on a 
third search point displacement pattern is greater than the 
second search point displacement pattern; and 
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1d) from the first series of comparisons, the second series 
of comparisons and the third series of comparisons, 
determining a match video block. 

5 2. The method of claim 1 , wherein at least one of: 

2a) further comprising establishing the first region, 
the second region, and the third region of the previously stored 
video frame to have a rectangular geometry; 

2b) step (1a) further comprises incrementing the first 

10 search point displacement pattern by 1 pixel; 

2c) step (1b) further comprises incrementing the first 
search point displacement pattern by 2 pixels; 

2d) step (1c) further comprises incrementing the third 
search point displacement pattern by 3 pixels; 

15 2e) further comprising formatting the first region to 

have at least a 10 x 10 pixel area, formatting the second 
region encircling the first region to have at least a 40 x 40 
pixel area, and formatting the third region encircling the 
second region to have at least a 40 x 60 pixel area; 

20 , 2f) step (1d) further comprises the steps of: 

2f1) performing a fourth series of comparisons 
between the present video block and the comparison video 
block, wherein for each comparison of the fourth series of 
comparisons, the comparison video block is incrementally 

25 positioned within a region surrounding the match video block 
based on a fourth search point displacement pattern, and 
wherein the fourth search point displacement pattern is equal 
to or smaller than the search point displacement of the first 
search point displacement pattern; and 

30 2f2) determining a best match video block from 

the fourth series of comparisons; 
and where further selected, wherein step (1a) further 
comprises incrementing the fourth search point displacement 
pattern by 0.5 pixels. 



35 
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3. A method for estimating motion within a video sequence, 
the method comprising the steps of: 

3a) for a predetermined number of video blocks, performing a 
5 series of comparisons between a present video block of a 
present video frame and a comparison video block of a 
previously stored video frame over a search region to fine a 
best match video block; 

10 3b) collecting video block statistics for the predetermined 
number of video blocks; and 

3c) changing the search region when the video block 
statistics are unfavorable. 

15 

4. In the method of claim 3, wherein at least one of: 
4a) step (3c) further comprises changing the search 

region when quality of the best match video block is poor; 
4b) step (3c) further comprises changing the search 
20 region when the best match video block is near the perimeter 
of the search region; 

4c) step (3c) further comprises changing the search 
region when the best match video block is near the center of 
the search region; 
25 4d) step (3c) further comprises changing the search 

region when the search region has not been changed for a 
predetermined time; 

4e) step (3c) further comprises changing the search 
region to produce a changed search region, wherein the changed 
30 search region and the search region have approximately the 
same number of search points; 

5. A method for compressing video, the method comprising 
the steps of: 

35 
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5a) receiving a present video frame that includes a plurality 
of video blocks: 

for each video block of the plurality of video blocks: 

5 

5b) performing a first series of comparisons between the 
each video block of the present video frame and a comparison 
video block of a previously stored video frame, wherein, for 
each comparison of the first series of comparison video block 
10 is incrementally positioned within a first region of the 
previously stored video frame based on a first search point 
displacement pattern; 

5c) performing a second series of comparisons between the 
15 each video block and the compassion video block, wherein, for 
each comparison of the second series of comparisons, the 
comparison video block is incrementally positioned within a 
second region of the, previously stored video frame based on a 
second search point displacement pattern, and wherein the 
20 second search point displacement pattern is greater that the 
first search point displacement pattern; 

5d) performing a third series of comparisons between the 
each video block and the comparison video block, wherein, for 

25 each comparison of the third series of comparisons, the 

comparison video block is incrementally positioned within a 
third region of the previously stored video frame based on a 
third search point displacement pattern, and wherein the third 
search point displacement pattern is greater than the second 

30 search point displacement pattern; 

5e) from the first series of comparisons, the second series 
of comparisons, and the third series of comparisons, 
determining a best match video block; 



35 



WO 9530310 



PCT/US95/03766 



19 

5f) subtracting the best match video block from the each 
video block to produce a residual video block; 

5g) evaluating the best match video block based on the each 
5 video block; and 

5h) encoding the residual video block when the evaluation is 
favorable. 

10 6. The method of claim 5, step (5h) further comprising 
encoding the each video block when the evaluation is not 
favorable. 

7. The method of claim 5, wherein step (5e) comprises the 
15 steps of: 

5e1) determining a first best match video block from 
the first series of comparisons, the second series of 
comparisons and the third series of comparisons; 
5e2) performing a fourth series of comparisons 
between the each video block and the comparison video 
block, wherein for each comparison of the fourth series 
of comparisons, the comparison video block is 
incrementally positioned within a region surrounding the 
first best match video block based on a fourth search 
point displacement pattern, and wherein the fourth 
search point displacement pattern is equal to or smaller 
than the search point displacement of the first search 
point displacement pattern; and 

5e3) determining the best match video block from the 
fourth series of comparisons; 
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